Fully On-Chip MAC at 14 nm Enabled by Accurate Row-Wise Programming of PCM-Based Weights and Parallel Vector-Transport in Duration-Format

نویسندگان

چکیده

Hardware acceleration of deep learning using analog non-volatile memory (NVM) requires large arrays with high device yield, accuracy Multiply-ACcumulate (MAC) operations, and routing frameworks for implementing arbitrary neural network (DNN) topologies. In this article, we present a 14-nm test-chip Analog AI inference—it contains multiple phase change (PCM)-devices, each array capable storing 512 $\times $ unique DNN weights executing massively parallel MAC operations at the location data. excitations are transported across chip duration representation on reconfigurable 2-D mesh. To accurately transfer inference models to chip, describe closed-loop tuning (CLT) algorithm that programs four PCM conductances in weight, achieving <3% average weight-error. A row-wise programming scheme associated circuitry allow us execute CLT up concurrently. We show test can achieve near-software-equivalent two different DNNs. demonstrate tile-to-tile transport fully-on-chip two-layer MNIST (accuracy degradation ~0.6%) resilience error propagation long sequences (up 10 000 characters) recurrent short-term (LSTM) network, off-chip activation vector-vector generate inputs used next on- MAC.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Row-wise parallel predicate evaluation

Table scans have become more interesting recently due to greater use of ad-hoc queries and greater availability of multicore, vector-enabled hardware. Table scan performance is limited by value representation, table layout, and processing techniques. In this paper we propose a new layout and processing technique for efficient one-pass predicate evaluation. Starting with a set of rows with a fix...

متن کامل

the comparative impact of prompts and recasts in processing instruction versus meaningful output-based instruction on efl learners’ writing accuracy

the purpose of the present study was to see which one of the two instruction-processing instruction (pi) and meaningful output based instruction (mobi) accompanied with prompt and recast- is more effective on efl learners’ writing accuracy. in order to homogenize the participants in term of language proficiency a preliminary english test (pet) was administrated between 74 intermediate students ...

Modeling and design of a diagnostic and screening algorithm based on hybrid feature selection-enabled linear support vector machine classification

Background: In the current study, a hybrid feature selection approach involving filter and wrapper methods is applied to some bioscience databases with various records, attributes and classes; hence, this strategy enjoys the advantages of both methods such as fast execution, generality, and accuracy. The purpose is diagnosing of the disease status and estimating of the patient survival. Method...

متن کامل

Bedload transport predictions based on field measurement data by combination of artificial neural network and genetic programming

Bedload transport is an essential component of river dynamics and estimation of its rate is important to many aspects of river management. In this study, measured bedload by Helley- Smith sampler was used to estimate the bedload transport of Kurau River in Malaysia. An artificial neural network, genetic programming and a combination of genetic programming and a neural network were used to estim...

متن کامل

Bedload transport predictions based on field measurement data by combination of artificial neural network and genetic programming

Bedload transport is an essential component of river dynamics and estimation of its rate is important to many aspects of river management. In this study, measured bedload by Helley- Smith sampler was used to estimate the bedload transport of Kurau River in Malaysia. An artificial neural network, genetic programming and a combination of genetic programming and a neural network were used to estim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Electron Devices

سال: 2021

ISSN: ['0018-9383', '1557-9646']

DOI: https://doi.org/10.1109/ted.2021.3115993